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3 <110> APPLICANT: E.I. duPont de Nemours and Company, Inc. 

4 Meyer, Knut 

5 Viitanen, Paul 

6 Van Dyk, Drew E. 

8 <120> TITLE OF INVENTION: High Level Production of P-Hydroxybenzoic Acid in Green 

Plants 

10 <130> FILE REFERENCE: BC1015 US DIV 

12 <140> CURRENT APPLICATION NUMBER: US 10/718, 311A 

13 <141> CURRENT FILING DATE: 2003-11-20 
15 <160> NUMBER OF SEQ ID NOS : 18 

17 <170> SOFTWARE: Patentln version 3.4 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 32 

21 <212> TYPE: DNA 

22 <213> ORGANISM: artificial sequence 

24 <220> FEATURE: 

25 <223> OTHER INFORMATION: Primer 

2 7 <400> SEQUENCE: 1 

28 ctactcattt catatgtcac accccgcgtt aa 32 

31 <210> SEQ ID NO: 2 

32 <211> LENGTH: 34 

33 <212> TYPE: DNA 

34 <213> ORGANISM: artificial sequence 

36 <22 0> FEATURE: 

37 <223> OTHER INFORMATION: Primer 

3 9 <4 00> SEQUENCE: 2 

40 catcttacta gatctttagt acaacggtga cgcc 34 

43 <210> SEQ ID NO: 3 

44 <211> LENGTH: 495 

45 <212> TYPE: DNA 

46 <213> ORGANISM: Escherichia coli 

48 <400> SEQUENCE: 3 

49 atgtcacacc ccgcgttaac gcaactgcgt gcgctgcgct attgtaaaga gatccctgcc 60 
51 ctggatccgc aactgctcga ctggctgttg ctggaggatt ccatgacaaa acgttttgaa 120 
53 cagcagggaa aaacggtaag cgtgacgatg atccgcgaag ggtttgtcga gcagaatgaa 180 
55 atccccgaag aactgccgct gctgccgaaa gagtctcgtt actggttacg tgaaattttg 240 
57 ttatgtgccg atggtgaacc gtggcttgcc ggtcgtaccg tcgttcctgt gtcaacgtta 300 
59 agcgggccgg agctggcgtt acaaaaattg ggtaaaacgc cgttaggacg ctatctgttc 360 
61 acatcatcga cattaacccg ggactttatt gagataggcc gtgatgccgg gctgtggggg 420 
63 cgacgttccc gcctgcgatt aagcggtaaa ccgctgttgc taacagaact gtttttaccg 480 
65 gcgtcaccgt tgtac 495 

68 <210> SEQ ID NO: 4 

69 <211> LENGTH: 165 

70 <212> TYPE: PRT 
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71 <213> ORGANISM: Escherichia coli 
73 <400> SEQUENCE: 4 

75 Met Ser His Pro Ala Leu Thr Gin Leu Arg Ala Leu Arg Tyr Cys Lys 

76 1 5 10 15 

79 Glu lie Pro Ala Leu Asp Pro Gin Leu Leu Asp Trp Leu Leu Leu Glu 

80 20 25 30 

83 Asp Ser Met Thr Lys Arg Phe Glu Gin Gin Gly Lys Thr Val Ser Val 

84 35 40 45 

87 Thr Met lie Arg Glu Gly Phe Val Glu Gin Asn Glu lie Pro Glu Glu 

88 50 55 60 

91 Leu Pro Leu Leu Pro Lys Glu Ser Arg Tyr Trp Leu Arg Glu lie Leu 

92 65 70 75 80 

95 Leu Cys Ala Asp Gly Glu Pro Trp Leu Ala Gly Arg Thr Val Val Pro 

96 85 90 95 

99 Val Ser Thr Leu Ser Gly Pro Glu Leu Ala Leu Gin Lys Leu Gly Lys 

100 100 105 110 

103 Thr Pro Leu Gly Arg Tyr Leu Phe Thr Ser Ser Thr Leu Thr Arg Asp 

104 115 120 125 

107 Phe lie Glu lie Gly Arg Asp Ala Gly Leu Trp Gly Arg Arg Ser Arg 

108 130 135 140 

111 Leu Arg Leu Ser Gly Lys Pro Leu Leu Leu Thr Glu Leu Phe Leu Pro 

112 145 150 155 160 

115 Ala Ser Pro Leu Tyr 

116 165 
119 <210> SEQ ID NO: 5 

12 0 <211> LENGTH: 3 9 

121 <212> TYPE: DNA 

122 <213> ORGANISM: artificial sequence 

124 <220> FEATURE: 

125 <22 3> OTHER INFORMATION: Primer 

127 <400> SEQUENCE: 5 

128 ctactcactt agatctccat ggcttcctct gtcatttct 39 

131 <210> SEQ ID NO: 6 

132 <211> LENGTH: 32 

133 <212> TYPE: DNA 

134 <213> ORGANISM: artificial sequence 
136 <220> FEATURE: 

13 7 <223> OTHER INFORMATION: Primer 
13 9 <400> SEQUENCE: 6 

140 catcttactc atatgccaca cctgcatgca gc 32 

143 <210> SEQ ID NO: 7 

144 <211> LENGTH: 684 

145 <212> TYPE: DNA 

146 <213> ORGANISM: artificial sequence 

148 <220> FEATURE: 

149 <223> OTHER INFORMATION: Chimeric gene encoding chloroplast- targeted CPL fusion 
protein 

151 <400> SEQUENCE: 7 

152 atggcttcct ctgtcatttc ttcagcagct gttgccacac gcagcaatgt tacacaagct 60 
154 agcatggttg cacctttcac tggtctcaaa tcttcagcca ctttccctgt tacaaagaag 120 
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156 caaaaccttg acatcacttc cattgctagc aatggtggaa gagttagctg catgcaggtg 180 

158 tggcatatgt cacaccccgc gttaacgcaa ctgcgtgcgc tgcgctattg taaagagatc 240 

160 cctgccctgg atccgcaact gctcgactgg ctgttgctgg aggattccat gacaaaacgt 300 

162 tttgaacagc agggaaaaac ggtaagcgtg acgatgatcc gcgaagggtt tgtcgagcag 360 

164 aatgaaatcc ccgaagaact gccgctgctg ccgaaagagt ctcgttactg gttacgtgaa 420 

166 attttgttat gtgccgatgg tgaaccgtgg cttgccggtc gtaccgtcgt tcctgtgtca 480 

168 acgttaagcg ggccggagct ggcgttacaa aaattgggta aaacgccgtt aggacgctat 540 

170 ctgttcacat catcgacatt aacccgggac tttattgaga taggccgtga tgccgggctg 600 

172 tgggggcgac gttcccgcct gcgattaagc ggtaaaccgc tgttgctaac agaactgttt 660 

174 ttaccggcgt caccgttgta ctaa 684 

177 <210> SEQ ID NO: 8 

178 <211> LENGTH: 227 

179 <212> TYPE: PRT 

180 <213> ORGANISM: artificial sequence 

182 <220> FEATURE: 

183 <223> OTHER INFORMATION: Synthetic chloroplast- targeted CPL fusion protein 
185 <400> SEQUENCE: 8 

187 Met Ala Ser Ser Val lie Ser Ser Ala Ala Val Ala Thr Arg Ser Asn 

188 15 10 15 

191 Val Thr Gin Ala Ser Met Val Ala Pro Phe Thr Gly Leu Lys Ser Ser 

192 20 25 30 

195 Ala Thr Phe Pro Val Thr Lys Lys Gin Asn Leu Asp lie Thr Ser lie 

196 35 40 45 

199 Ala Ser Asn Gly Gly Arg Val Ser Cys Met Gin Val Trp His Met Ser 

200 50 55 60 

203 His Pro Ala Leu Thr Gin Leu Arg Ala Leu Arg Tyr Cys Lys Glu lie 

204 65 70 75 80 

207 Pro Ala Leu Asp Pro Gin Leu Leu Asp Trp Leu Leu Leu Glu Asp Ser 

208 85 90 95 

211 Met Thr Lys Arg Phe Glu Gin Gin Gly Lys Thr Val Ser Val Thr Met 

212 100 105 110 

215 lie Arg Glu Gly Phe Val Glu Gin Asn Glu lie Pro Glu Glu Leu Pro 

216 115 120 125 

219 Leu Leu Pro Lys Glu Ser Arg Tyr Trp Leu Arg Glu lie Leu Leu Cys 

220 130 135 140 

223 Ala Asp Gly Glu Pro Trp Leu Ala Gly Arg Thr Val Val Pro Val Ser 

224 145 150 155 160 

227 Thr Leu Ser Gly Pro Glu Leu Ala Leu Gin Lys Leu Gly Lys Thr Pro 

228 165 170 175 

231 Leu Gly Arg Tyr Leu Phe Thr Ser Ser Thr Leu Thr Arg Asp Phe lie 

232 180 185 190 

235 Glu lie Gly Arg Asp Ala Gly Leu Trp Gly Arg Arg Ser Arg Leu Arg 

236 195 200 205 

23 9 Leu Ser Gly Lys Pro Leu Leu Leu Thr Glu Leu Phe Leu Pro Ala Ser 
240 210 215 220 

243 Pro Leu Tyr 

244 225 

247 <210> SEQ ID NO: 9 

248 <211> LENGTH: 34 
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249 <212> TYPE: DNA 

250 <213> ORGANISM: artificial sequence 

252 <220> FEATURE: 

253 <223> OTHER INFORMATION: Primer 

255 <400> SEQUENCE: 9 

256 ctactcattt gaagactgca tgcaggtgtg gcat 34 

259 <210> SEQ ID NO: 10 

260 <211> LENGTH: 34 

261 <212> TYPE: DNA 

262 <213> ORGANISM: artificial sequence 

264 <220> FEATURE: 

265 <223> OTHER INFORMATION: Primer 

267 <400> SEQUENCE: 10 

268 catcttactg tcgactttag tacaacggtg acgc 34 

271 <210> SEQ ID NO: 11 

272 <211> LENGTH: 37 
2 73 <212> TYPE: DNA 

274 <213> ORGANISM: artificial sequence 

276 <220> FEATURE: 

277 <223> OTHER INFORMATION: Primer 

279 <400> SEQUENCE: 11 

280 ctactcattt ggccagctct gtcatttctt cagcagc 37 

283 <210> SEQ ID NO: 12 

284 <211> LENGTH: 31 
2 85 <212> TYPE: DNA 

286 <213> ORGANISM: artificial sequence 
288 <220> FEATURE: 

2 89 <223> OTHER INFORMATION: Primer 
2 91 <400> SEQUENCE: 12 

2 92 catcttacta gatctttagt acaacggtga c 31 

295 <210> SEQ ID NO: 13 

296 <211> LENGTH: 33 
2 97 <212> TYPE: DNA 

298 <213> ORGANISM: artificial sequence 

300 <220> FEATURE: 

301 <223> OTHER INFORMATION: Primer 

303 <400> SEQUENCE: 13 

304 cccgggggta cctaaagaag gagtgcgtcg aag 33 

307 <210> SEQ ID NO: 14 

308 <211> LENGTH: 46 

309 <212> TYPE: DNA 

310 <213> ORGANISM: artificial sequence 

312 <220> FEATURE: 

313 <223> OTHER INFORMATION: Primer 

315 <400> SEQUENCE: 14 

316 gatatcaagc tttctagagt cgacatcgat ctagtaacat agatga 46 

319 <210> SEQ ID NO: 15 

320 <211> LENGTH: 62 

321 <212> TYPE: PRT 
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322 <213> ORGANISM: artificial sequence 

324 <220> FEATURE: 

325 <223> OTHER INFORMATION: Synthetic chloroplast -targeting sequence 
327 <400> SEQUENCE: 15 

32 9 Met Ala Ser Ser Val He Ser Ser Ala Ala Val Ala Thr Arg Ser Asn 
330 15 10 15 

333 Val Thr Gin Ala Ser Met Val Ala Pro Phe Thr Gly Leu Lys Ser Ser 

334 20 25 30 

33 7 Ala Thr Phe Pro Val Thr Lys Lys Gin Asn Leu Asp He Thr Ser He 
338 35 40 45 

341 Ala Ser Asn Gly Gly Arg Val Ser Cys Met Gin Val Trp His 

342 50 55 60 

345 <210> SEQ ID NO: 16 

346 <211> LENGTH: 170 

347 <212> TYPE: PRT 

348 <213> ORGANISM: artificial sequence 

350 <220> FEATURE: 

351 <223> OTHER INFORMATION: Processed chloroplast -targeted CPL synthetic fusion protein 
353 <400> SEQUENCE: 16 



355 


Met 


Gin 


Val 


Trp 


His 


Met 


Ser 


His 


Pro 


Ala 


Leu 


Thr 


Gin 


Leu 


Arg 


Ala 


356 


1 








5 










10 










15 




359 


Leu 


Arg 


Tyr 


Cys 


Lys 


Glu 


He 


Pro 


Ala 


Leu 


Asp 


Pro 


Gin 


Leu 


Leu 


Asp 


360 








20 










25 










30 






363 


Trp 


Leu 


Leu 


Leu 


Glu 


Asp 


Ser 


Met 


Thr 


Lys 


Arg 


Phe 


Glu 


Gin 


Gin 


Gly 


364 






35 










40 










45 








367 


Lys 


Thr 


Val 


Ser 


Val 


Thr 


Met 


He 


Arg 


Glu 


Gly 


Phe 


Val 


Glu 


Gin 


Asn 


368 




50 










55 










60 










371 


Glu 


He 


Pro 


Glu 


Glu 


Leu 


Pro 


Leu 


Leu 


Pro 


Lys 


Glu 


Ser 


Arg 


Tyr 


Trp 


372 


65 










70 










75 










80 


375 


Leu 


Arg 


Glu 


He 


Leu 


Leu 


Cys 


Ala 


Asp 


Gly 


Glu 


Pro 


Trp 


Leu 


Ala 


Gly 


376 










85 










90 










95 




379 


Arg 


Thr 


Val 


Val 


Pro 


Val 


Ser 


Thr 


Leu 


Ser 


Gly 


Pro 


Glu 


Leu 


Ala 


Leu 


380 








100 










105 










110 






383 


Gin 


Lys 


Leu 


Gly 


Lys 


Thr 


Pro 


Leu 


Gly 


Arg 


Tyr 


Leu 


Phe 


Thr 


Ser 


Ser 


384 






115 










120 










125 








387 


Thr 


Leu 


Thr 


Arg 


Asp 


Phe 


He 


Glu 


He 


Gly 


Arg 


Asp 


Ala 


Gly 


Leu 


Trp 


388 




130 










135 










140 










391 


Gly 


Arg 


Arg 


Ser 


Arg 


Leu Arg 


Leu 


Ser 


Gly 


Lys 


Pro 


Leu 


Leu 


Leu 


Thr 


392 


145 










150 










155 










160 


395 


Glu 


Leu 


Phe 


Leu 


Pro 


Ala 


Ser 


Pro 


Leu 


Tyr 















396 165 170 

399 <210> SEQ ID NO: 17 

400 <211> LENGTH: 180 

401 <212> TYPE: PRT 

402 <213> ORGANISM: Solanum lycopersicum 
404 <400> SEQUENCE: 17 

406 Met Ala Ser Ser Val He Ser Ser Ala Ala Val Ala Thr Arg Ser Asn 

407 15 10 15 

410 Val Thr Gin Ala Ser Met Val Ala Pro Phe Thr Gly Leu Lys Ser Ser 
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